Compiling and analysing a large corpus of online discussions to explore users’ interactions
نویسندگان
چکیده
This methodology-focused paper reports how I compiled and analysed a 12-million-word corpus of threaded online discussions by employing Corpus Workbench tool (CWB, Evert & Hardie, 2011) combining analysis with micro-analysis drawing on the principles digital Conversation Analysis. The not only affords an efficient retrieval large dataset, but also, more importantly, facilitates exploration based different variables (e.g., topics discussions, role internet users, types postings) units subforums, threads, postings). Examples are presented to illustrate used this investigate various aspects extract threads surrounding particular topic or language practices for micro-analysis. propose users’ interactions in can be further explored field linguistics using synergy interactional approach.
منابع مشابه
Introduction: Compiling and analysing the Spoken British National Corpus 2014
For over twenty years, the British National Corpus has been one of the most widely known and used corpora. It is almost impossible to attend an international corpus linguistics conference such as Corpus Linguistics, ICAME (International Computer Archive of Modern and Medieval English), AACL (American Association for Corpus Linguistics) or APCLC (Asia Pacific Corpus Linguistics Conference) witho...
متن کاملAnalysing and Predicting Recurrent Interactions among Learners during Online Discussions in a MOOC
High attrition rates are one of the biggest concerns in MOOCs. One of the possible causes may be learners’ lack of interactions and low levels of participations in MOOCs online discussions. Research to measure and predict recurrent interactions of learners in MOOCs online discussions has the potential to gain inside into the likely impact on the attrition rate. It is argued that personalisation...
متن کاملAnalysing online discussions in educational and work based settings
Networked learning is becoming more and more about connectivity of learners or professionals and connectivity to resources available online and sometimes freely. Researchers are making use of these by designing online environments where this notion of connectivity and vast resources available to learners can be exploited. Many online discussion tools are available for use in educational setting...
متن کاملA Corpus of Online Discussions for Research into Linguistic Memes
We describe a 460-million word corpus of online discussions. The data are collected from public news websites and community-ofinterest Internet forums, and are designed to support research on the propagation of socially relevant ideas, a.k.a., “memes.” A structural and statistical description of the corpus is given, and the employed methods of website monitoring, collection, and extraction are ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied corpus linguistics
سال: 2022
ISSN: ['2666-7991']
DOI: https://doi.org/10.1016/j.acorp.2022.100017